Picture for Zhaoxiang Zhang

Zhaoxiang Zhang

TVIR: Building Deep Research Agents Towards Text--Visual Interleaved Report Generation

Add code
Jun 01, 2026
Viaarxiv icon

MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research

Add code
May 27, 2026
Viaarxiv icon

GoClick: Lightweight Element Grounding Model for Autonomous GUI Interaction

Add code
Apr 27, 2026
Viaarxiv icon

AutoGUI-v2: A Comprehensive Multi-Modal GUI Functionality Understanding Benchmark

Add code
Apr 27, 2026
Viaarxiv icon

WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models

Add code
Apr 20, 2026
Viaarxiv icon

CodeTracer: Towards Traceable Agent States

Add code
Apr 14, 2026
Viaarxiv icon

ReinDriveGen: Reinforcement Post-Training for Out-of-Distribution Driving Scene Generation

Add code
Apr 01, 2026
Viaarxiv icon

DynVLA: Learning World Dynamics for Action Reasoning in Autonomous Driving

Add code
Mar 11, 2026
Viaarxiv icon

GA-Drive: Geometry-Appearance Decoupled Modeling for Free-viewpoint Driving Scene Generatio

Add code
Feb 24, 2026
Viaarxiv icon

FeatureBench: Benchmarking Agentic Coding for Complex Feature Development

Add code
Feb 11, 2026
Viaarxiv icon